When rule-based models need to count
نویسندگان
چکیده
Rule-based modelers dislike direct enumeration of cases when more efficient means of enumeration are available. We present an extension of the Kappa language which attaches to agents a notion of level. We detail two encodings that are more concise than the former practice. Rule-based languages are a well-established framework for modeling proteinprotein interactions. Kappa [2,1] is a rule-based language relying on site-graphs. The nodes of sitegraphs are called agents. Agents interact by binding/unbinding through sites. Sites are binding resources, each site is part of at most one edge. A model in Kappa consists of a set of graph rewrite rules with rates. A rule describes a potential interaction given a context. Rates represent probability to fire. In a biological context, it is often the case that a notion of internal state (such as active, methilated, . . . ) is required in order to describe possible interactions. In Kappa, sites are equipped with internal states, facilitating the modeling efforts of the user. However, as shown in the following sections a more systematic encoding of internal states is possible. Another common practice is to attach a level to agents and make an interaction sensitive on the level of its participating agents. We propose here a language extension to store, test and change levels explicitly. Moreover, we present an encoding of levels that induce a linear (in number of levels) blow up of the number of rules. This is in contrast to previous encodings, which induce an exponential blow up in the number of rules. 1 Email: Pierre [email protected] 2 Email: Ioana [email protected] This paper is electronically published in Electronic Notes in Theoretical Computer Science URL: www.elsevier.com/locate/entcs ar X iv :1 70 8. 02 65 1v 1 [ qbi o. O T ] 7 A ug 2 01 7 Boutillier, Cristescu 1 When enumeration is necessary The following motivating example [4] demonstrates a typical problem in which levels are necessary: KaiC proteins have 6 independent phosphorylation sites. (De)phosphorylation of every site is independent. The more sites are phosphorylated, the bigger the probability that KaiC binds KaiA is. A typical way to deal with this example consist in explicitly encoding rules for the internal states of the sites of interest. However, doing so induces an exponential blow-up in the number of rules (in the number of levels). The BNGL [3] language introduces a notion of indistinguishable sites. i.e. one can define an agent with n sites that all have the same name. Consequently, a single rule to specify that k sites (out of n) are phosphorylated is enough. Moreover, the number of species is also reduced. Still, there are exponentially many ways to go from one level to another and enumeration is necessary to faithfully respect the dynamics of the system. We now show that adding a syntactic layer to Kappa that offers support for counting will avoid the explosion in the number of rules.
منابع مشابه
Fitting of Count Time Series Models on the Number of Patients Referred to Addiction Treatment Centers in Semnan County
Abstract. Count data over time are observed in many application areas. Many researchers use time series patterns to analyze this data. In this paper, the poisson count time series linear models and negative binomials on this type of data with the explanatory variables are studied. The Likelihood analysis and the evaluation of count time series model based on generalized linear models are pres...
متن کاملInvestigating the missing data effect on credit scoring rule based models: The case of an Iranian bank
Credit risk management is a process in which banks estimate probability of default (PD) for each loan applicant. Data sets of previous loan applicants are built by gathering their data, and these internal data sets are usually completed using external credit bureau’s data and finally used for estimating PD in banks. There is also a continuous interest for bank to use rule based classifiers to b...
متن کاملRule-based of Monetary Policy in Iran Inspired by McCallum Rule
Economists have reached a consensus that an independent central bank could improve its policy efficiency by following a monetary policy rule. One of the important rules is McCallum rule where that requires central banks to target the growth rate of nominal GDP using the monetary base as its instrument. One of the features of the McCallum rule uses the monetary base rather than the interest rate...
متن کاملSpatial count models on the number of unhealthy days in Tehran
Spatial count data is usually found in most sciences such as environmental science, meteorology, geology and medicine. Spatial generalized linear models based on poisson (poisson-lognormal spatial model) and binomial (binomial-logitnormal spatial model) distributions are often used to analyze discrete count data in which spatial correlation is observed. The likelihood function of these models i...
متن کاملUsing multivariate generalized linear latent variable models to measure the difference in event count for stranded marine animals
BACKGROUND AND OBJECTIVES: The classification of marine animals as protected species makes data and information on them to be very important. Therefore, this led to the need to retrieve and understand the data on the event counts for stranded marine animals based on location emergence, number of individuals, behavior, and threats to their presence. Whales are g...
متن کاملControl Chart Recognition Patterns using Fuzzy Rule-Based System
Control Chart Patterns (CCPs) recognition is one the most important concepts in control chart application. Relating the patterns exhibited on the control chart to assignable causes is an ambiguous and vague task especially when multiple patterns co-exist. In this study, a fuzzy rule-based system is developed for X ̅ control charts to prioritize the control chart causes based on the accumulated e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1708.02651 شماره
صفحات -
تاریخ انتشار 2017